Smooth infinite words over n-letter alphabets 
having same remainder when divided by n 



Yun Bao Huang 
Department of Mathematics 
Hangzhou Normal University 
Xiasha Economic Development Area 
Hangzhou, Zhejiang 310036, China 
huangyunbao@sina.com 
huangyunbao@gmail.com 

2010.11.18 

Abstract. Brlek et al. (2008) studied smooth infinite words and established 
some results on letter frequency, recurrence, reversal and complementation for 
2-letter alphabets having same parity. In this paper, we explore smooth infi- 
nite words over n-letter alphabet {ai,a2,--- ,a n }, where a\ < 0,2 < ■ ■ ■ < a n 
are positive integers and have same remainder when divided by n. And let 
04 = n ■ qi + r, qi G N for i = 1, 2, • • • , n, where r = 0, 1, 2, ■ ■ ■ , n — 1. We 
use distinct methods to prove that (1) if r = 0, the letters frequency of two 
times differentiable well-proportioned infinite words is 1 /n, which suggests that 
the letter frequency of the generalized Kolakoski sequences is 1/2 for 2-letter 
even alphabets; (2) the smooth infinite words are recurrent; (3) if r = or 
r > and n is an even number, the generalized Kolakoski words are uniformly 
recurrent for the alphabet S n with the cyclic order; (4) the factor set of three 
times differentiable infinite words is not closed under any nonidentical permu- 
tation. Brlek et al.'s results are only the special cases of our corresponding 
results. 

Keywords: Smooth infinite words; reversal; recurrence; uniformly recurrence; 
letter frequency; well-proportioned infinite words. 
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1. Introduction 



The Kolakoski sequence K which was first introduced in [19], is the infinite sequence 
over the alphabet S = {1, 2} which starts with 1 and equals the sequence defined by 
its run lengths: 

122112122122 

Here, a run is a maximal subsequence of consecutive identical symbols. The curious 
Kolakoski sequence K has received a striking attention by showing some intriguing 
combinatorical properties [3, 6, 7, 8, 10, 14, 15, 16, 20, 21, 23, 26, 27]. 

Keane [17] asked whether the density of l's in K is 0.5. Chvatal [9] proved that 
the upper density of l's as well as the upper density of 2's in K is less than 0.501. 
Brlek, Jamet and Paquin [4] investigated smooth infinite words on 2-letter alphabets 
having same parity and showed that all smooth infinite words are recurrent; that the 
closure of the set of factors under reversal holds for odd alphabets only; and that the 
frequency (density) of letters in extremal words is 1/2 for even alphabets, and for 
a = 1 with b odd, the frequency of &'s is 1/ (V2& — 1 + 1). 

Baake and Sing [1] and Sing [24, 25] established a connection between the gen- 
eralized Kolakoski words and model sets. By the way, Sing [24, 25] showed that for 
2-letter alphabets having same parity, the generalized kolakoski sequences are the 
fixed points of some suitable primitive substitutions, which means the generalized 
kolakoski sequences are uniformly recurrent. 

In this paper, our main goal is to study the corresponding problems on n-letter 
alphabets having same remainder r when divided by n. By using distinct methods, 
we give all the fixpoints of the operator A (Theorem 4) and establish that (1) smooth 
infinite words are recurrent (Theorem 18); (2) if the remainder r is or r > and n is 
an even number, the generalized Kolakoski words are uniformly recurrent for n-letter 
alphabets with the cyclic order (Theorem 20); (3) if r = 0, the letters frequency is 
1/n for two times differentiable well-proportioned words (Theorem 7), which means 
that the frequency of the generalized Kolakoski sequences is 1/2 for 2-letter even 
alphabets (Corollary 8); (4) the factor set of three times differentiable infinite words 
is not closed under any nonidentical permutation (Theorem 10). Moreover we provide 
a new proof of smooth infinite words being closed under reversal for odd alphabets. 

The paper is organized as follows. In Section 2, we shall first fix some notations and 
introduce some notions. Secondly in Section 3, we give all the fixpoints of the operator 
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A over E n . In Section 4, we establish the frequency of two times different iable well- 
proportioned words for n- letter alphabets having the remainder r = 0. In Section 
5, we show that three times different iable infinite words are not closed under any 
nonidentical permutation for n-letter alphabets. In Section 6, we give some useful 
notations of the inverse $ _1 of the bijection $ : C% — > E^. In Section 7, we establish 
some lemmas which are used in the following discussions. In Section 8, we prove 
that smooth infinite words are reccurent for n-letter alphabet with the cyclic order. 
In Section 9, we establish that if the remainder r is 0, or r > and n is an even 
number, then the generalized Kolakoski sequence is the fixpoint of some suitable 
primitive substitution for n-letter alphabets having same remainder r with the cyclic 
order, which means that the generalized Kolakoski sequence is uniformly reccurent. 
In Section 10, we give a new proof of smooth infinite words being closed under reversal 
for 2-letter odd alphabets. Finally we end this paper with some concluding remarks 
in Section 11. 

2. Definitions and notation 

Let E„ = {ai, 02, ■ ■ ■ , a n }, where a\, 02, • ■ • ,a n are positive integers with a\ < ai < 
■ ■ ■ < a n , E* denotes the free monoid over E re . A finite word over E n is a member of 
E* . If w = W1W2 ■ ■ ■ Wki Wi G E n for i = 1, 2, ■ ■ • , k then k is called the length of the 
word w and is denoted by Sometimes we also use w[i] to denote the ith letter 
Wi of the word w, that is, w = w[l]w[2] ■ ■ -w[k], and w[i . . . j] = w[i]w[i + 1] • • -w[j] 
for 1 < i < j < k. If |i*;|=0 then w is called the empty word and is denoted by e. 
Let \w\ a be the number of a which occurs in w, where a G E n , then \w\ = E™ =1 |u>| ai . 
Moreover E+ denotes E* - {e}, E^ = {w G E* : \w\ = k} for k = 0, 1, 2, • • • . 

The set of all right infinite words is denoted by E^, and E^° = E^ U E* . Given a 
word w G E£°, a factor u of w is a word u G E^° such that w=xuy for x, y G E^°, and 
F(w) denotes the set of all factors of w, F^w) = F(w) D E*. If x = e (resp. y — e) 
then u is called prefix (resp. suffix). A run (or block) is a maximal factor of form 
u = a k ,a G E„. Pref(w) denotes the set of all prefixes of w, Pref n (w) denotes the 
prefix of length n of w. Finally denotes the set of all natural numbers, M* and 
denote the free monoid and the set of all right infinite words over J\[ respectively, 
where N is the set of positive integers. 

Let u = UiU 2 ---Uk G E*, where Ui G E n , i = 1,2, •••n fc . The reversal of u 
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is the word u = Uk ■ ■ - U2U\. A palindrome is a word P such that P = P. Let 
u = U1U2 • • • G E^°, where Uj, G E n , % = 1, 2, • • • . The permutation o~{u) of u is the 
word cr(u) = o~{ui)o~{u2) ■ • • , where a is a permutation of E n . Especially, if n = 2, then 
E2 has an unique nonidentical permutation - , which is determined by a\ — 02, 02 = ai 
and is also said to be the complement. 

An infinite word w is recurrent if every factor has infinitely many occurrences. 
And an infinite word w is uniformly recurrent if every factor appears infinitely often 
with bounded gap. 

We see that every word w G E£° can be exclusively written as a product of factors 
as follows: 

w = a^a^a^ • • • , where atj G E n , atj 7^ a J+ i andz^ G A/" U {w} for j G N . (2.1) 

From (2.1) it is obvious that w is uniquely determined by the sequences ri J >i^ an d 
YLj>i a j> which are respectively said to be the base sequence and power index sequence 
of w, and be respectively denoted by A(w) and B{w). That is 

A : E* -> A/"*, defined by 

AH =W3-- = [I^ ( 2 - 2 ) 

fe>i 

B : E; -> E;, defined by 

B(w) = aia 2 a 3 ■ ■ • = Y\ a k, (2-3) 

fe>i 

which is easily extended to E^ (also refers to [2]). From (2.1-2.3) it immediately 
follows that 

w G E* is a palindrome <^=^ A(w) and B(w) are both palindromes. (2-4) 

The function (A, B) gives the run-length coding {RLE) on E* , which is a very simple 
form of data compression in which runs of data are stored as a single data value and 
count, rather than as the original run. Run- length encoding performs lossless data 
compression and is well suited to palette-based iconic images. For example, it is used 
in fax machines and is relatively efficient because most faxed documents are mostly 
white space, with occasional interruptions of black. 

Remark 1: If w G E^ in (2.1) and = k, then a& = uj. For example, w = 

2 3 4 2 3 4 4 aJ G {2,3,4}^, then A(w) = 324a; and B(w) = 2434. 
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For any w G E£°, first (w) denotes the first letter of the word w. For each w G E* , 
last(w) denotes the last letter of the word w. It is clear that the operator A satisfies 
the property: A(uv) = A(u)A(v) if and only if last(u) 7^ first(v). 

Since A (10) is independent of the choice of the base sequence B(w), the function 
A is not injective because A(aw) = A(w) for every word w and any permutation a of 
£„. For instance, if w = 2 2 1 3 3 5 7 6 (4 3 5 2 6 4 ) 1000 then A(w) = 2356(324) 1000 and B(w) = 
2137(456) 1000 . In fact, over E 7 = {1, 2, 3, 4, 5, 6, 7}, there are 7 ■ 6 1003 different choices 
of the base sequence B(w) having the same power exponent sequence 2356(324) 1000 
with the word w. In general, for each word w over E n (n > 2), if |A(w)| = k, then 
there are n(n — l) fe_1 different words over E n having the same power index sequence 
A (it;) with the word it;. 

Let <B n , = {u G E^ : u { ^ u i+1 for i G M}. If n = 2 then Q3 2 = {(aia 2 ) w , (a 2 «i) w }; 
if n > 2 then |Q3 n | — 00. Thus, given a sequence B G *B n , the pseudo-inverse function 

A^ 1 : E£° -)• E£°, it = UiU 2 u 3 U4 ■ ■ ■ 
can be defined by 

» 1/ n f VfVfbf ■ ■ ■ , if it G E"; 

A„ m = ^ 1 2 3 2.5 
\ b u ,% 2 ■■■b u k k , if |u| = fc. 

Then the following property is immediate: 

Vit G E~ : jA^u) = A~£(u), where a is a permutation of E n ; 

Vugs;: a7(S) = a 5 1 H. 

Now we first generalize the notion of finite different iable word over the alphabet 
{1, 2}, which was given first by Dekking [11], to the one of infinite different iable word 
over any n- letter alphabet E n . 

Definition 1. Let it; = a* 1 ^ 2 ■••GSjJ and k be a fixed positive integer. 

(1) If A l (w) G for % = 1,2, ■•• ,k, then we call 10 a k-times differentiable 
infinite word over E n ; 

(2) If A k {w) G E^ for any positive integer k then we call 10 a C^-word (or a 
smooth infinite word) over E n . 

For example, let w = (3 3 l 3 3 3 1313 3 l 3 3 3 13 3 13 3 l 3 3 3 1313 3 l 3 3 3 13 3 l 3 3 3 l) w then w is a 
4-times differentiable word, but it is not a 5-times differentiable word over {1,3}. 

Secondly we extend the notion of differentiable words over the alphabet {1, 2} to 
over arbitrary n- letters alphabet E n . 
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Definition 2. Let w G £* and u> = a^ct^ 2 • • • where «j G E n and ojj 7^ ccj+i for 1 < 
% < k — 1, and 1 < tj < a n for % — 1, 2, • • • , k. 

If ti G E n for f = 2, 3, ■ • • , k — 1, then we call that w is differentiable, and its 
derivative, denoted by D(w), is the word whose jth symbol equals the length of the 
jth run of w, discarding the first and/or the last run if its length is less than a n . 

Let w G S* and k be a fixed positive integer, if D h (w) G S* then we call w a 
k-times differentiable word over E n , or a C^-word. If a finite word u; is arbitrarily 
often differentiable, then we call w a C^-word, or a smooth word. Obviously, if w is 
a smooth word and |u>| > 0, then |.D(u>)| < |u?|. Moreover, it is clear that D is an 
operator from S* to S* and 

yz, where y + z > 1, y, z < a n or w = e 
a n xa n or A (it;) = a n , 
-Dfw;) = < xa„. A(w) = yxa n and 1 < y < a n (2.6) 

a n xz and 1 < z < a n 
yxz and 1 < y, z < a n 

It is easy to see that finite factors of smooth infinite words are all C^-words. Thus 
finite smooth words [2], which are finite factors of smooth infinite words, are always 
C°°-words. 

b 

Let w = a b ~ l a b a b ■ ■ ■ f3 b f3 b ~ l G where if 2 | b then (3 = a, otherwise (3 = a, 
if b — a > 1 then w can not occur in any smooth infinite word, so C°°-words are not 
always finite smooth words. 

In what follows, we use C% and to stand for the sets of all /c-times differentiable 
finite words and all fc-times differentiable infinite words respectively. 

It is easy to check that A and D all commute with the mirror image (~) and are 
stable for every permutation a over the alphabet S n . Thus Proposition 4 in [5] still 
holds for arbitrary n-letter alphabets. 

Lemma 3. Let a be a permutation of S n , then 

(1) For all u G E* n , D(u) = D(u), D(au) = D(u); 

(2) For all u G A(u) = A(u), A(tru) = A(«) ( [2] Proposition 1 (a)-(b)). □ 

These properties indicate that C^°, C%, C\ and C* w are all closed under these 
operators: 



w G C£° crw, !cGC 



00 . 



6 



weC k n 
w e C% 



3. Fixpoints of the operator A 

The operator A over (ai = a, a 2 = b) has exactly two fixpoints, that is A(K a ^) = 
K at b, A(Kb ta ) = Kb t a, where K a b (otK^o) is an infinite sequence over the alphabet 
£2 which starts with a (or 6) and equals the sequence defined by its run lengths: if 
a > 1 then 

Ka,b — Q a y ■• • a a y p! b a b ^ ■ ■ b\ - ■ ■ , where a = b if a is even, otherwise a = a, 

a a 

Kb,a = b b a b ■ ■ ■ (3 b (3 a a a ■ ■ ■ a a ■ ■ ■ , where (3 = a if b is even, or else (3 = b, 

b b 

which are called the generalized Kolakoski sequences. 

If n > 2, then the operator A over has infinitely many fixpoints (see [2] 
Concluding remarks). In fact, we can determine all the fixpoints of the operator 
A. To do so, let Fix (A) denote the set of all fixed points of the operator A and 
V = y^VzVl 3 ' ' ' e Fix (A), where y x y 2 ■ ■ ■ y» • • • G Q5 n . Then by y = A(y), we have 

t\ = t 2 = ■ ■ ■ = t yi = yi, t yi+ i = t yi+ 2 = ■ ■ ■ = t yi +y 2 = 1/2 • • • , ^ (2/1 -1) = 

*(in-i)-i/i+2 = • • • = t yi . yi = y yi , • • • , it immediately follows that 

Theorem 4. Let S n = {a 1 ,a 2 ,--- ,a n }, where a\ < a 2 < ■ ■ ■ < a n G Af. Then 
Fix(A) = {K u : u E <B n } and 

IT — U! _ Ul U 2 U 2 . ..,,"2 ...„""! . . . ,,""1 ... -H 

XV U — U 2 "ui "«i+l"«i+2 "ui+ui "(tii-l)-tti+l "«r«l ) K - 1 ) 

where u = Uiu 2 u 3 • • • . □ 

The sequences satisfying (3.1) are said to be the generalized Kolakoski sequences. 
Theorem 4 suggests that if n > 2 then there are infinitely many generalized Kolokoski 
sequences over the n- letters alphabet E n . Moreover, it is obvious that the generalized 
Kolakoski word K u over S n is a smooth infinite word for every u G 

4. Letter frequencies 

Keane [17] put forward whether the density of l's in K is 0.5. It is still a challenging 
problem. Actually, the best upper density of l's as well as the upper density of 2's in 
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K is 0.50084 and is due to Chvatal [9]. Brlek, Jamet and Paquin [4] showed that the 
frequency of letters in extremal words is 1/2 for even alphabets, and for a = 1 with b 



odd, the frequency of fo's is l/(y/2b — 1 + 1). Hereinafter we establish the frequency of 
letters in 2-times differentiable well-proportioned infinite words for arbitrary n-letters 
alphabets with each member being a multiple of n. 

Lemma 5. Let u = a^a^ 2 ■ ■ ■ a£ , where ojj, tj G £ n = {ai, a 2 , • • • , a n }, 1 < i < k, 
and n \ a,- for j = 1, 2, • • ■ , n. And let m — \u\ — S*L 1 tj, then for any v = v±v 2 ■ ■ ■ v m G 
E* such that {^.„+i, Vi. n+2 , • • ■ , Vi. n+n } = {ai, a 2 , • ■ ■ , a n ,}, where z = 0, 1, ■ • • ,p and 
m = n ■ p, one has 

|A~ m O) | aj = 1 ^ 1 Jl for i = 1, 2, ■ • • , n. (4.1) 
n 

Proof. Since t±, t 2 , ■ ■ ■ , i& G S n and n | a i; we see that ti — n-Si for i = 1, 2, • • • , k. 
Thus 

ti 



(Mj - V 2 V n V n+1 V n+2 V n+n W ( Sl _i) n+ i^( Sl _i) n+2 V ( Sl -l) n+n 



t-2 



u t 1 +l u t 1 +2 u t 1 +n u t 1 +n+l u t 1 +n+2 u tx+2-n u q-n+l u q-n+2 u q 



h+2 (/ ft+n"(i+n+l l/ /i+n+2 w /i+2-n u m-n+l u m-n+2 u m • 

(4.2) 

where g = t± + t 2 , h = t 1 + t 2 + • • • + ifc-i- Since n | t 3 - for j = 1, 2, • • • , n and 
{^+1,^+2, •• • ,^-n+n} = {ai,a 2 , •■• ,a n } for i = 0, 1, ■ ■ • ,p, we easily see that 
there are equal numbers of a"% a 2 \ ■ ■ • , and a"' in the subsequence 

U 



^+1^+2 ' ' ' V b l +n V b 1 +n+l V b 1 +n+2 ' ' ' V bi+2-n ' ' ' V bi+t k -n+l V bi+t u -n+2 ' ' ' V bi+t k 



for i — 1, 2, • • • , /c, where G £„, &i = and 6j = ti+t 2 +- ■ for i = 2, 3, • ■ ■ , A;. 

Therefore there are equal numbers of a"% a 2 % • • • , and a"* in the right side of (4.2), 
which suggest that (4.1) holds. □ 

Remark 2: From the proof of Lemma 5, we easily see that Lemma 5 also holds if 
the condition ti G E n is substituted by ti being divided by n. 
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Definition 6. Let w G If the base sequence B(w) = a\a 2 ■ ■ ■ at - ■ ■ satisfies 

{aci. n+1 , ai. n+2 , ■■■ , ai. n+n } = {a 1: a 2 , ■ ■ ■ , a n } for Vz G A/", (4.3) 

then w is said to be well-proportioned. 

Now we are in a position to prove our second main result. 

Theorem 7. Let w be a two times differentiate well-proportioned infinite word over 
the n-letters alphabet E n = {a±, a 2 , ■ ■ ■ , a n }. If n | a, for j — 1, 2, • ■ • , n, then 

liml^f = ifori = l,2,..-,fi, (4.4) 

where — w[l ■ ■ ■ k] = Prefk(w). 
Proof. Since A 2 (w) G E^ we have 

A(w) = 7^ 7 * 2 • • ■ ^; ■ ■ ■ , where 7i , t 4 G E„ for z = 1, 2, - - - . (4.5) 

Thus (4.5) suggests 

»=limA B 1 (AH), (4.6) 

where B(w)i = Prefi(B(w)), A(w)i = Prefi(A(w)). Therefore from (4.6) we see that 
for each k G A/", there is a positive integer z such that 

W k = A-l w)i {A{w)i)v, where v G E* and |v| < (4.7) 

Now (4.3), (4.7) and (4.1) imply that (4.4) holds. □ 

Since infinite words over 2-letter alphabets are always well-proportioned, from 
Theorem 7 we can deduce 

Corollary 8. The frequency of letters in 2-times differentiable infinite words is 1/2 
for 2-letter even alphabets. □ 

Furthermore from Theorem 7 it immediately follows that 

Corollary 9. If n | aj for i = 1,2, •■■ , n, then the frequency of letters in well- 
proportioned smooth infinite words is 1/n for the alphabet E n . □ 
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5. Permutation 



Kimberling [18] first raised whether or not the complement of a finite factor of Ko- 
lakoski sequence K is still a factor of K and whether or not every finite factor of K 
occurs infinitely often (recurrence). Dekking [11] showed that the closure of F(K) 
by complementation would imply the recurrence property. These conjectures were 
stated for every smooth infinite word over {1, 2} in [3]. Brlek etc. [5] proved that the 
existence of arbitrarily long palindromes in infinite smooth words on {1,2} would im- 
ply the recurrence property. Recently Brlek etc. [4] showed that the closure of F(w) 
under reversal holds for odd alphabets only ([4] Proposition 15) and that the set of 
factors of extremal smooth words is not closed under reversal and under complemen- 
tation ([4] Proposition 26) for even alphabets. Corollary 9 (iii) in [4] implies that the 
set of factors of extremal smooth words on {1,3} is not closed under complementa- 
tion. Next we establish the corresponding result for n-letter alphabets having same 
remainder when divided by n. And the proof seems to be slightly more essential and 
straight forward . 

Theorem 10. Let S n = {a±, a 2 , • • • , a n }, where a\ < a 2 < ■ ■ ■ < a n G M and a, = 
r (mod n) for % = 1, 2, • • • , n, where r = 0, 1, • • • , n — 1. Suppose that w G C^j w and 
B(w) = (hb 2 ---b n )", B(A(w)) = (c lC2 ---c n r, B(A 2 (w)) = ( ei e 2 ---e n r, where 
bib 2 ■ ■ ■ b n , c\c 2 ■ ■ ■ c n and t\t 2 ■ - ■ e„ are arrangements of a\, a 2 , ■ ■ • , a n . Then F(w) is 
not closed under any nonidentical permutation. 

Proof. Since w G and B(A 2 (w)) = {t\t 2 ■ ■ ■ e n Y , we have 

A 2 (w) = e^e* 2 ■ ■ • e%e\ n+1 e 2 n+2 ■ ■ ■ e t2 " ■ ■ ■ ef n+1 ef n+2 ■ ■ ■ el (fc+1) " 1 • • • (5.1) 

where tj G £ n for i G N. 

From d, U G E n , we get ej = n-qi+r, qi G N for i = 1, 2, • • • , n; U = n-hi+r, hi G 
N for i — 1, 2, • • • , where r = 0, 1, • • • , n — 1. Now let c n -k+i — Q for i — 1,2, ■ ■ • ,n 
and k G N, then note that c n .k = c n , by B(A(w)) = {c\C 2 ■ ■ ■ c n ) w and (5.1), we can 
obtain 

A(w) = (c 1 1 c 2 1 ■ • • c n x ) 1 c 1 1 c 2 1 ■ ■ • c r 1 (c r 2 fl c r 2 f2 ■ • • c r 2 ) 2 c r 2 +1 c T 2 +2 ■ ■ ■ c 2r ■ ■ ■ 

V°(n-l)r+l°(n-l)r+2 ^{n-l)r> u (n-l)r+l^{n-l)r+2 ^n-r 

(r ei r ei n ei\h n+1 e 1 e ± e\ ( e 2 e 2 „e 2 \h n+2 e 2 e 2 e 2 

\ L 1 c 2 c n J c l c 2 c r \ L r+l c r+2 c r ) c r+l c r+2 L 2r 

V L (n-l)r-+l°(n-l)r+2 °(n-l)r/ °(n-l)r+l (n.-l)r+2 °n-r W'^V 
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Let b n .i + j = bj for j = 1, 2, • • • , n and % G A/", then note that b n .i = b n , from B(w ) 
(61 62 " " 'bn) 10 and (5.2) it immediately follows that 



,C2 
2r 



« = [( W • • ■ WW • • • 6? (^ 2 +2 • • • v?TVM%.2 ■ ■ ■ b 

1'<1 

^ W (n-l)r+l W (n-l)r+2 ' ' ' W (n-l)rJ °(n-l)r+l W (n-l)r+2 ' ' ' W n-rJ 

(b Cl b Cl ■ ■ - b Cl \ qi b Cl b Cl ■ ■ - b Cl (b C2 b c ' 2 ■■■h C2 ) qi h C2 b c ' 2 ■ ■ ■ h° 2 ■ ■ ■ 

\ u l °2 u n ) u l u 2 u r \ u r+l u r+2 u r ) u r+l u r+2 u 2 r 

llf r ic r ...h Cr ) qi h Cr b Cr ■■■b Cr 

\ U {r-l)r+l U (r-l)r+2 U (r-l)r> b (r-l)r+l C (r-l)r+2 r 2 

[(b Cr +\ ■ ■ ■ b Cr 2 +1 ) q2 b c z\ ■ ■ ■ b% + 2 (b c \ + 2 ^ • • • b% + 2 ) q2 b c V2 ^ • • • b% +2 ■ ■ ■ 

(h Cr ■■■b Cr \ q2 h Cr ...h Cr ] h2 

\ U r 2 + (n-l)r+l U r 2 +(n-l)rJ U r 2 +(n-l)r+l U r 2 \ 

(b Cr +\ ■ ■ ■ b Cr 2 +1 ) q2 b c Z\ ■ ■ ■ b Cr 2 + 2 (b Cr 2 + 2 ^ • • • b Cr 2 + 2 ) q2 b Cr 2 + 2 ^ • • • b Cr +\ • • • 

(h C2r ■ . . h° 2r \Q2tf2r . . . yflr . . . 

y U r 2 + (r-l)r+l U r 2 +(r-l)r> U r 2 +(r-l)r+l U 2r 2 

r/7 C(n-l)r+l L C (i-l)'-+l\gnI ) C ( rl - 1 ) r + 1 7 C (n-l)r + l 

U%-l)r 2 +l ' ' ' U (n-l)r 2 ) U (n-l)r 2 +l ' ' ' U (n-l)r 2 +r 
/,C(„_ 1 ) r+2 ,C (n _ 1 ) r+2 < g .C( n _ 1)r+2 ,c (n _ 1)r+2 

V°(n-l)r 2 +r+i ' ' ' °(n-l)r 2 +W °( n -l)r 2 +r+l ' ' ' °{n-l)r 2 +2-r 

(l c {n-l)r L C 0~l)r \ q n L C ("- l)r h C (ri-l)r 1 fe n 

^(n-l)r 2 + (n-l)r+l ' ' ' %-l)r 2 + (n-l)r/ U (n-l)r 2 +(n-l)r+l ' ' ' W (n-l)r 2 J 

/7 C (n-l)r+l L C ("-l)'-+l\qnI ) C (™- 1 ) r + 1 L C (n-l)r+l 

^(n-l)r 2 +l ' ' ' °{n~l)r 2 ) U {n-l)r 2 +l ' ' ' U {n-l)r 2 +r 

(l C (n-l)r+2 ,C( n _ 1 ) r+2 \g„i C(n-l)r+2 ,C( n _i) r + 2 

V°(n-l)r 2 +r+l ' ' ' °(n-l)r 2 +W °(n-l)r 2 +r+l ' ' ' °{n-l)r 2 +2-r 

(U c nr IXnr ~L.C nr / [r o\ 

\ u (n-l)r 2 +(r-l)r+l ' ' ' U (n-l)r 2 +(r-l)r> u (n-l)r 2 +(r-l)r+l ' ' ' U nr 2 ' ' ' 

From (5.2, 5.3) we see that the maximal factors of w, which are composed of 
consecutive runs of the same length, must be one of the following forms. 

e j 



(lPti-l)r+l t C 0-l)r+l \qj J C (j-l)r+l L C 0-l)r + l 

^ %'-l)r 2 +l ' ' ' U {j-l)r 2 +n> %-l)r 2 +l ' ' ' U (j-l)r 2 +r' 
^ 



(lf(3-l-)r + 2 7 C (j-l>+2 \qjl C (]-l)r + 2 ,CQ-!) r+ 2 

\ U {j-l)r 2 +r+l ' ' ' U {j-l)r 2 +rJ U (j-l)r 2 +r+l ' ' ' %-l)r 2 +2-r' 



(U C U- 1 ) r + n J c (j-l)r + n \qj l c (j-l)r + n 7 C (j-l)r + 7i ( K A~\ 

\ %'-l> 2 +(n-l)r+l ' ' ' U (j-l)r 2 +(n-l)r/ U (j-l)r 2 +(n.-l)r+l ' ' ' U (j-l)r 2 +n-r' \°-*J 

where j = 1, 2, • • • , n. 

Note that by (5.4), the length of all possible maximal factors of w, which is 
composed of consecutive runs of the same length, is respectively equal to Cj for j = 
1, 2, • • ■ , n. Since e\, ■ ■ ■ ,e n is an arrangement of a\, a2, ■ ■ ■ , a n , there is some 
k e M such that e& = a n . Thus on the one hand, since e& = a n , taking j — k in (5.4), 
we get 
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c (fe-l)r + l 7 C (&-1)t-+1 \q k 7 C (fe-l)r + l L C (fc-l)r+l 

(fc-l)r 2 +l ' ' ' °(fc-l)r 2 W °(fc-l)r 2 +l ' ' ' °(fc-l)r 2 +r> 



C( fe _i )r+ 2 ,C (fe _ 1)r+ 2 C( fc _i) r+2 ,C (fe _i )r+ 2 

(fc-l)r 2 +r+l ' ' ' U (k~l)r 2 +r) (fc-l)r 2 +r+l ' ' ' °(fc-l)r 2 +2-r' 



/ " V 

/7 C (fc-l)r+n ,C (fe „ 1)r + n > qk , C (fe _ 1)r+ „ jC(fc-l)r + 7i / r r\ 

V U (fe-l)r 2 +(n-l)r+l ' ' ' "(fc-l)r 2 +(n-l)W u (k-l)r 2 +(n-l)r+l ' ' ' U (fc-l)r 2 +n-r ' °> 

On the other hand, since a n > aj for j < n, from (5.4) we see that the max- 
imal factors of the word w, which are composed of consecutive runs of the same 
length and are of the greatest length, have all occurred in (5.5). Therefore, from 
C(fc_i) r+ i, C(fc_i) r+2 , • • • , C(fc_i) r+n being an arrangement of ai, a 2 , ■ ■ • , a n , it follows that 
for any nonidentical permutation a of E n , the image of each member of (5.5) under 
the permutation a cannot occur in (5.5), which means that w is not closed under any 
nonidentical permutation. □ 

From the proof of Theorem 10, we easily see that two times differentiable infinite 
words are not closed under complementation for 2-letter alphabets having same parity. 



6. The bijection $ from to 

From now on, similar to [2] concluding remarks, we need to assume a cyclic order 
bib2 • • • b n of the alphabet E n such that the base sequence of every word is com- 
patible with this order, where bi,b 2 ,--- ,b n is a fixed arrangement of the elements 
ai,a 2 , ■ ■ • , a n of the alphabet E n . And suppose that b t . n+i = b$ for any t G N and 
i = 1,2, •• ■ ,n. Note that if n = 2, then aia2 is the unique cyclic order of E 2 . In 
fact, there are exactly (n — 1)! different cyclic orders of E n . Let 

C k n = {w G C k n : B(w) is compatible with bib 2 ■ • • b n }; (6.1) 
= {w G C£° : B(w) is compatible with b^ • • • b n }; (6.2) 
C = {w G CI : B(w) is compatible with bib 2 ■ • • b n }. (6.3) 

Remark 3: Note that C\ = Cf = C 2 °°, C$ = C%. 

In what follows, the pseudo-inverse function A" 1 (a G E n ) is compatible with the 
cyclic order bib 2 • • • b n of the alphabet E n . Let p = a±a 2 ■ ■ ■ oik-, where a\, a 2 , ■ ■ ■ ,ctk 
G E n , we use A~ k {u) to denote A~* A~* • • • A~Hu) . Clearly, if p = p\p 2 then 
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For example, given a cyclic order 243 of the alphabet {2,3,4}, then 
A^(24) = 2 2 4 4 ; 

A23 3 2 (24) = 2 3 4 3 3 2 2 2 4 4 3 4 2 4 4 4 3 3 2 3 4 3 3 3 2 2 4 2 3 2 2 2 4 4 3 4 2 4 4 4 
= A 2 - 1 A 32 2 (24) 
= A 23 2 A 2 - 1 (24). 

In order to study infinitely often differentiable infinite words of any period over {1,2}, 
Dekking [12] established a bijection from to E 2 . Similarly, we can define a bijection 
from C£ to as below: 

which is constructed by setting 

$(»[j + 1] = A J '(w)[l], for j > 0, (6.4) 
and its inverse is defined as follows: 

lim A^ (u[k]). (6.5) 

k— >oo 1 J 

In the sequel, for any u E E°° and 1 < % < Ui denotes u[l • ■ -i] and 

^\u[l---k}) = A:{i^_ 1] (u[k}), (6.6) 

which determines a function from E+ to E+. Then by (6.5-6.6), we get 

= lim $ _1 (u[l ■■•£]) (6.7) 
= lim ^(f/fc). (6.8) 

Obviously, K\ a = $~ 1 (6 W ) and # a , 6 = ^(a"). 



7. Some Lemmas 

The following simple results are important in the sequel. First of all, from the defini- 
tion of the operator A" 1 (a 6 E„) it immediately follows that 

Lemma 11. Let u E E+, v E E+ (E£) and a E E,„. 

(1) If \u\ = q ■ n + k, where q E M and k — 0, 1, ■ ■ • , n — 1, then 
A~ 1 (uu) = A~ 1 (m)A^ 1 (w), where a = and bj = b i+k ; 

(2) If n | then 
4 1 H=A- 1 («)A^ 1 M. □ 
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The following Lemma 12 (1) is a generalization of Brlek et al. [4] Lemma 21. 

Lemma 12. Let w G X+, a G E n . 

(1) If a\, a,2, ■ ■ ■ , and a n when divided by n have the same remainder and \w\ is a 
multiple of n, then the length of A~ 1 (w) is also a multiple of n; 

(2) If n = 2, ai, a 2 are odd integers and w has odd length, then A" 1 ^) also has 
odd length; 

(3) If n — 2, ai, a 2 are odd integers and u> is a palindrome of odd length, then 
A~ 1 (w) is also a palindrome of odd length. 

Proof. (1) Let w = ai« 2 • • ■ where k G M and Oij G S„ for z G A/", then 
A~ 1 (w) = a ai a a2 ■ • ■a OLn h . Thus since all a±, • ■ ■ , and a n -k when divided by n 
have the same remainder, we have 

n-k n-k n-k 

\ A a l ( w )\ = E«i = E( n •<?; + ?") =w (Eft + 

i=l i=l i=l 

where ccj = n • + r, < r < n — 1, which suggests that the length of A~ l (w) is a 
multiple of n; 

(2) Let u> = ai« 2 • • • «2fc+i, where ccj G S 2 , then since A~ 1 (w) = a ai a a2 • • • a a2k+1 

2k+l 

and all «i, a 2 , • • • a 2 k+i are odd integers, we see that |A~ 1 (w)| = «j is also an 

i=l 

odd integer; 

(3) Note that if w has odd length, then it is clear that A~ 1 (w) = A~ 1 (w). Thus if 
w is a palindrome of odd length, then A~ l (w) is also a palindrome of odd length. □ 

From Lemmas 11-12, it easily follows that 

Lemma 13. Let w = uv, p = aia 2 ■ ■ ■ ak, where u G £+, v G a.j G E n , z = 

1,2,- .-A;. 

(1) If ai, d2, ■ • ■ , and a n when divided by n have the same remainder and \u\ is a 
multiple of n, then 

A~ k (uv) = A- k (u)A- k (v); 

(2) If n = 2, ai, a 2 are odd integers and u has odd length, then 
A- k (uv) = A; k (u)A p k (v). □ 

Furthermore from Lemma 13 immediately follows: 

Lemma 14. Let w = V\v 2 ■ ■ ■ v n , p = a\a 2 ■ ■ ■ ak, where Vj G E+, v n G CKj G 

E n , l<j<?2 — 1, l<i<A;. 
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(1) If ax, a,2, • • • and a n when divided by n have the same remainder and \vi\ is a 
multiple of n for i = 1, 2, . . . , n — 1, then 

A;» = A^OA;^) . ■ ■ A; fc K); (7.1) 

(2) If n = 2 and ax, ai are odd integers and all Vj (1 < j < n — 1) have odd 
length, then 

A;» = A p -^i)Ar fc (*, 2 ) . . . A -^ n ), (7.2) 

where if 2 | n then g = p, or else g = p. □ 

The function defined by (6.6) is of the following important property, which 
guarantees that the limit of (6.5) exists. 

Lemma 15. If u G Pref(v), \u\ = m and \v\ = n, then 

G Pref(<$>-\v)). (7.3) 

Proof. Since w G Pref(v), we have w[l • • -m] = m[1 • • - m]. Thus by (6.6), we obtain 

*-\«) = A^^M) 

= A^^A-iA-* •••A~ r 1 ^(ufnl) 

ti[l---m— 1] v[m\ v[m+l\ v[n— X\ \ i J/ 

= A" ( l m " l) 1l (A- r 1 ^A-r 1 • ■ ■ A", 1 nfufn]))) 

= A-^-^^m^) 

= A; [ ( 1 m ^ 1 2 1] (n[m]) 2/ 

= ^\u)y, 

which suggests that (7.3) holds. □ 

With respect to the usual topology defined by 

d({u k ) k >x, {v k ) k >x) := 2~ min ^- u ^\ 
the pseudo-inverse operator is continuous because it preserves the prefix relation 
between two words. The following result is of independent interest. 

Lemma 16. (1) The pseudo- inverse operator commutes with the limit operator, that 
is, let v G E£, then v = lim V k and 

A-\ lim V k ) = lim A- 1 ^), where a G E n . (7.4) 

k— >oa k— ¥oa 

(2) Let w = uv, where u G E+, v G E£, then 
$- l (w) = A-^(^ 1 (v)). (7.5) 
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Proof. (1) Since Vk = v[l • ■ • k], we have 
A~\hmV k ) = A-\v[l ■■■}<■■■}) 

k— >oo 

= a" 1 6^.! 6^.2 ■ ' " > where a = bi 
= \\mA-\Vk). 

k— »oo 

(2) Let it = u[l ■ ■ ■ n] and v = v [1 • ■ ■ k ■ ■ ■ ], then by (7.4-7.5), we easily see that 
= hm A- u ^ti\v[k}) 

fc— >-oo 1 J 

= □ 

Finally, it is easy to get 

Lemma 17. 10 G is recurrent if and only if there are infinitely many prefixes 

Pre fn i+ ii W ) ° f W SUch tllat Pre fn l+1 ( W ) = X ' Pre fn z ( W ) " ^ ' Pre f n t ( W ) " Z > where 

x, y E E*, 2 G S w , < n i+ i for i = 1, 2, 3, • • • . □ 



8. Recurrence of smooth infinite words 

Brlek et al. proved that smooth infinite words are recurrent for 2-letter alphabets 
having same parity (see Proposition 15 and Theorem 25 in [4]). We now establish that 
smooth infinite words are recurrent for n-letter alphabets having the same remainder 
when divided by n and the following proof seems to be slightly more easily understood. 

Theorem 18. Let E n = {ai, a 2 , - ■ ■ , a n } with <2j = n-qi+r, where < r < n — 1, G 
M for % — 1, 2, • • ■ , n. Then every infinite smooth word over £„ is recurrent. 

Proof. Since $ is a bijection from C w to S w , by (6.5) and (6.8), for each infinite 
smooth word w, there is a u G S w such that 

u? = §~ l {u) 

= lim^f/fc). (8.1) 

fc— >oo 
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And note that U k = u[l • • • k], from (6.6) it immediately follows that 

We next divide the proof into two cases in view of the remainder r of a± , a 2 , ■ ■ ■ and a n 
when divided by n. And let b k . n+i = hi for k e N and z = 1, 2, • ■ ■ , n. 
Case 1. r = 0. Then by Lemma 14 (1), we have 



K^UH-^ ■ ■■b^i+n)}", where u k . 2 = h 



r a -(fc-4) / ft, 7 bi . . . ft 6i n 



*-(fc-4) / b i+lf b i+1 ,6 i+1 \ 
^[l-fe-^^fc-a^i+l u j-l+n) 



Uk-l 

, A 

-(fe-4) / 6j_i +n ,6i_i +Tl lii-i'-"^ 



A^4](%™^r • • -^rJ] - . where u k . 3 = b 3 

[A; [ ( i fc :£ 4 ]( u ^3)^iA; [ ( 1 fc :£ 4] (n fe _3)a; 2 • ■ ■ a- ( /t£ 4] K- 3 k]^ 

[$- 1 ([/ fe - 3 )^i^ 1 (^-3)x 2 • ■ ■ S^-sK]-, 



which suggests that w is recurrent by Lemma 17, where A u ^ ^^(u b k ^ l 3 b^[ ■ ■ ■ b"^[ +n 



\[i-k-4]( u k-3) x i for / = 0, 1, ■ ■ • , n - 1 and k = 5, 



6. 



Case 2. r > 0. Then, analogously, by Lemma 14 (1), we obtain 

= ^^(f/fc-Oa;]^^-^^!)^ (8.3) 



where A^frjU) = A^K^rr, A^'CuJU) = ^\u k ^)y. 

Case 2.1. If there are infinitely many u k such that u k > r, then there exist infinitely 

many u k such that (u k — r)/n > 1. Thus from (8.3) and Lemma 17 it immediately 
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follows that w is recurrent. 

Case 2.2. u = v • r w and r = b s , where v G S*. Then from Lemma 16 it follows that 

|(<&-V)). 



«; = A; 

Case 2.2.1. If r > 2, then by (8.4) and Lemma 16 (1), we get 
A^'l(lim A-^-^(r)) 



(8.4) 



k— >oo 

lim A;l u l(A-^-^(r)) 
lim A~ |u| (<]> _1 (r fe )). 

fc— >oo 



5.5) 



Let q be the smallest positive integer such that the number of runs of A r q (r) is larger 
than n. Then 



a; h ($-V)) 



X 

A; 



a; 

A; 



l(A r -( fc -^ 1 )(A^(r))) 
|( A -(fc- 9 -i)^ l6 t 2+i ...^ 

I^Cfc-?-!)^*!^ . . . &^_ 1 )A7 (fc -«- 1) (r t » +1 
l(A;^" 1 )(r))xA;^(A T T^ 1 )(r)) 2/ 
I ($-i ( r k - q ))x A^l (<5>~\r k - q ))y, 

where t{ G S n and XlILi ^* * s a mu hiple of n, 

A- |u| (A r T ( ^- 1) (r))x = A,-H(A r - (fe - 9 - 1) (r tl ^ + i • ■ ■ 6&W-1)), 
A-^^A"^- 1 ^^ ■■•)) = A^'CA^*"'" 1 ^))^ 
Thus (8.5) and (8.6) mean that u> is recurrent by Lemma 17. 
Case 2.2.2. If r = 1, then by (8.4) and Lemma 16 (1), we get 

a;H($-i ( i-)) 

A-H(i^-i( & - )) 



)) 



(8.6) 



w 



A;M(lim l*" 1 ^)) 
-l/'z.fc 



lim A;W(W-\b« +1 )). 



1.7) 



Thus 



a;'"'(i$- 1 ^ +1 )) 



a; 
a; 
a; 
a; 
a; 



(1A-M(& S+1 )) 
(lA^*- a) (6^)) 



((lA^ 2 ^6 s+ 0)x(lA- ( + fc i - 2) (6 s+1 ))) 

(i*- 1 (6^!" 1) ))yA7 H (i$- 1 (6^!" 1) )) ) 
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which still implies that w is recurrent by Lemma 17 and (8.7), where A b ^ k (b b s 3 +i 1 ) 
= Al%- 2 \b s+1 )xl and A~^(lA-^ 2 \b s+1 )x) = ^(l^(b^))y^D 

9. Primitive substitution and uniform recurrence 

A substitution a is a function from the alphabet S n into the set of nonempty 
words; it can be extended to a morphism of S* in a natural way by concatenation. 
A substitution a is primitive if there exists a positive integer k such that, for any a 
and b in £ n , the letter a occurs in a k {b). About the periodic points of a primitive 
substitution, we have the following simple fact. 

Proposition 19. ( [13] Proposition 1.2.3.) If a is primitive, then any of its periodic 
points is a uniform recurrent sequence. □ 

Brelk et al. [4] showed that smooth infinite words are recurrent for same parity 
alphabets, but we do not know wether or not smooth infinite words are uniformly 
recurrent for same parity alphabets. Note that S n is the alphabet with the cyclic 
order bib2 • • ■ b n . Now we consider the generalized Kolakoski word K clC2 ... Cn over E n 
with the base sequence B(K ClC2 ... Cn ) = (cic 2 ■ ■ ■ c n Y and 

Cj = qi ■ n + r for i — 1, 2, ■ ■ • , n. (9.1) 

For 2- letter alphabet S 2 = {ci, C2} having same parity, if c\ = 2m and c 2 = 2n, 
where m, n G A/", then K C1C2 is the fixed point of the following primitive substitution 
which was given by Sing [24]: 

A ^ A m B m 
B ^ A n B n , 

where A = c\ and B = c|. 

If c\ = 2m + 1 and c 2 = 2n + 1, where m < n G A/", then K C1C2 is the fixed point 
of the following primitive substitution /i which Sing constructed in [25]: 

A ^ A m BC m 
H : A m BC n 
C ^ A n BC n , 

where A = c\, B = C1C2 and C = c 2 . 
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Thus Proposition 19 gives the generalized kolakoski sequences are uniformly re- 
current for 2-letter alphabets having same parity. 

In this section, when r = 0, or r > and n is an even number, we construct a 
primitive substitution a of E„ such that K ClC2 ... Cn is the fixpoint of a. 
Case 1. r = in (9.1). 

In this case, the letters ci, c 2 , • ■ • , c n are all the multiples of n, then set Ai = cf 
for i — 1, 2, • • • , n, and the substitution a is given by 

a: A % ^ AfAf ■ ■ ■ A% for % = 1,2, ■ •• ,n. (9.2) 

Note that qi > 0, from (9.2) it immediately follows that a is both a primitive substi- 
tution and K ClC2 ... Cn = Hm^^ <r*(Ai), which means that K ClC2 ... Cn is a fixpoint of the 
primitive substitution (9.2). 

Case 2. r > in (9.1) and n is an even number with n = 2m, where m £ jV. 

Set Aj = c" for i = 1, 2, • • • ,n, B t — c r 2i _ x c r 2i for i — 1, 2, • • • , m. 
Case 2.1. r = 2/i, h £ A/". Then the corresponding substitution a are determined by 

4 /l<?2fc+l 75 AQ2k+l AQ2k+l r> AQ2k+l A<l2k+1 r> 4<?2fc+l 

^2fc+l ^ /1 2fc-r+l- Dfc - r + l/1 2fcT+2 /1 2fc-r+3- D fc-^+2 /1 2fc-r+4 ' ' ' A 2k-r+n-i n k-r+rn^ 2 k-r+i 
R u-!. Al2k+1 TJ Afl2k\\ AQ2k+l R Al2k+1 4<?2(fe+l) 

_ -Dfc+1 >~r n -2k-r+l n k-r+l^-2k-r+2 ' ' ' ^2(fc-r+h)-l^-r+^2(fc-r+/i) /1 2(fc-r+/i)+l 
R 4<?2(fc+l) ^92(fe+l) R A<l2(k+1) 

^>k-r+h+l^ 2 (k-r+h+l) ' ' ' /1 2(fc+l)r-l- D (fc+l)'- /1 2(fc+l)r 

A I v /1«2(fc+l) R /t92(fe+l) /|92(fe+l) R /l<?2(fc+l) 

^2(fe+l) ^ /i 2(fc+l)r+l jD (fc+l)^+l /1 2(fc+l)r+2 ' ' ' /1 2(fc+l)r+n-l jD (fe+l)r /i 2 (fe+l)r+n ' 

where fc = 0, 1, 2, • • • , m — 1, A t = Aj if i = j (mod n), B^ = Bj if i = j (mod m). 

For example, if E = {2,6, 10, 14} then n = 4,r = 2. Thus A 1 = 6 4 , A 2 = 10 4 , 
A 3 = 14 4 , A 4 = 2 4 , Bi = 6 2 10 2 , B 2 = 14 2 2 2 . From (9.3) it follows that 

A 1 m- A 1 B 1 A 2 A 3 B 2 A 4 
Bt ^ A X B X A 2 A\B 2 A\ 

<Ti : 112 3 2 4^ g 4 

A 3 ^ A\B x AlAlB 2 Al 
B 2 ^ A\B X A\B 2 
Ai H- B X B 2 

By (9.4), we have 

A x ^ 6 6 10 6 14 6 2 6 ^ 6 6 10 6 14 6 2 6 6 6 10 6 14 10 2 10 6 10 10 10 14 10 2 10 

6 14 10 14 14 14 2 14 6 14 10 14 14 2 2 2 6 2 10 2 14 2 2 2 ^ . . . 

= 6 6 10 6 14 6 2 6 6 6 10 6 14 10 2 10 6 10 10 10 14 10 2 10 

6 14 10 14 14 14 2 14 6 14 10 14 14 2 2 2 6 2 10 2 14 2 2 2 . . . = J^^. (9.5) 
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From (9.4) it easily follows that for any u and v in the alphabet {2,6,10,14}, 
u must occur in cr 3 (v), which suggests that a is a primitive substitution over the 
alphabet {2,6,10,14}. Moreover, from (9.5) we have i^6(io)(i4)2 = lim^ff^i!). 
Thus -^6(10) (14)2 is a fixpoint of the primitive substitution (9.4). 

In general, note that at most one of qi, q 2 , ■ ■ ■ and q n can take the value 0, from 
(9.3) we easily see that for any u and v in the alphabet {A, A 2 , . . . , A n , Bi, B 2 , • • ■ , B m }, 
u must occur in a 3 (v), which suggests that a is a primitive substitution and K clC2 ... Cn = 
hm^oo a l (A\). Therefore K ClC2 ... Cn is a fixpoint of the primitive substitution (9.3). 
Case 2.2. r = 2h + l, h 6 N. Then the corresponding substitution a are determined 
by 

A i_i Al'2k+1 r> Al2k+1 Al2k+l p Al2k+1 Al2k+l r> Al2k+1 

^-2k+l i-r /1 2fe-r+l- D fc-r-+l /1 2fc-r+2 /1 2fc-r+3- D *:^+2 /1 2fc-r+4 ' ' ' /1 2fc-r+n-l- D fc-r+m^2fc-r+n 

f> ,_v A<l2k+1 f> Al2k+1 Al2k+1 f> Al2k + 1 Al2k+1 

n k+1 i-f ^ 2k . r+1 ^->k-r+l^- 2k . r+2 ' ' ' ^2(fc-r+/i)-l^-r+^2(A>r+fr.) /1 2(fc-r+h)+l 
R A<l2{k + 1) Al2(k+1) R A<l2(k+1) 

£>k-r+h+l^ 2 (k-r+h+l) ' ' ' /1 2(fc+l)r-l- D (fc+l)'' /1 2(fe+l)r- 



Si2(k+1) ^ 2 (fc+l)r+l- D (fc+l)'-+l /1 2(fc+l)r+2 ' ' ^ i 2(fe+l)r+n-l- D (fe+l)^ /1 2(fe+l)r-+n ' 



where fc = 0, 1, 2, • • ■ ,m — 1, Aj = Aj if z = j (mod n), Bi = Bj if z = j (mod m). 

For example, if X = {1,5,9,13} then n = 4,r = 1. Thus At = 5 4 , A 2 = 9 4 , 
A 3 = 13 4 , A 4 = l 4 , 5i = 59, B 2 = (13)1. From (9.6) it follows that 

1 y A 1 B 1 A 2 A 3 B 2 A 4 

B 1 ^ A X B X A\ 

A 2 h. A\B 2 A\A\B X A\ 
<J 2 : . 9.7 

A3 h. Ap 2 AiA?5iA| 

fi 2 h+ A 3 5 2 

In view of (9.7), we get 

A i-» 5 5 9 5 13 5 1 5 h-> 5 5 9 5 13 5 1 5 5 5 9 9 13 9 1 9 5 9 9 9 13 13 1 13 5 13 9 13 13 13 159(13)1 h-> • • ■ 

'59(13)1- 



5 5 9 5 13 5 1 5 5 5 9 9 13 9 1 9 5 9 9 9 13 13 1 13 5 13 9 13 13 13 159(13)1 • • ■ = K 5 



By an argument similar to Case 2.1, from (9.6) we obtain that K ClC2 ... Cn is a fixpoint 
of the primitive substitution (9.6). 

Up to now, we have proved that if r = or r > and n is an even number, then 
K clC2 ... Cn is a fixpoint of the primitive substitution a. Thus from Proposition 19 it 
immediately follows that K clC2 ... Cn is uniformly recurrent, which suggests that 
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Theorem 20. Let n be an integer larger than 1, and ai = qi ■ n + r for % = 

I, 2, • • • , n, < r < n — 1. Ifr = 0orr>0 and n is an even number, then 
the generalized Kolakoski words over the n-letter alphabet S n with the cyclic order 
bib2 ■ • • b n are the uniform recurrent sequences. □ 

10. Reversal 

Brlek et al. proved that the set F(w) is closed under reversal for 2-letter odd alphabets 
by [4] Lemma 11. For n-letter alphabet E n with the cyclic order bib2 • • • b n , if n > 2, 
then the reversal of any factor of bib2 ■ ■ • b n of length larger than 1 is not compatible 
with bib2 ■ ■ ■ b n . Thus the reversal of any factor of a smooth infinite word w with 
the number of runs being larger than 1 must be not again a factor of w. Thus closure 
property of reversal of factors of smooth infinite words is of meaningful only for 2- 
letter alphabets. Since the proof of [4] Lemma 11 seems to be somewhat complicated, 
we now give a different proof, which is slightly more explicit. 

Theorem 21. ([4] Proposition 15) Let wbea smooth infinite word over the alphabet 
S 2 . If ai, <22 are odd integers, then the set F(w) is closed under reversal. 

Proof. Since w = for some u £ by (6.7), w = lim $ -1 (tt[l • • • k}). 

Note that u[k] is a palindrome of length 1 and a, b are odd integers. So by (6.6) and 
Lemma 12 (3), we see that $ _1 (u[l •••&]) = A^A"^ ■ • • A~^ k _^(u[k]) is a palindrome 
of odd length. Since for every / £ F(w), there is a positive integer m such that 
$ _1 (u[l ■ ■ -m\) = gfq, so $ _1 (u[l ■ • -m]) = $ _1 (w[l • ■ -m]) = qfg, that is, / £ F(w), 
which guarantees that F(w) is closed under reversal. □ 

II. Concluding remarks 

Brelk et al. [4] showed that smooth infinite words are recurrent for 2-letter alphabets 
having same parity. In the section 8, we show that smooth infinite words are recurrent 
for n-letter alphabets S n having same remainder when divided by n. In the section 9, 
we establish that the generalized Kolakoski words over the n-letter alphabet S n are 
uniformly recurrent except for the case r > and n being an odd integer. 

Thus for the alphabets E n having same nonzero remainder when divided by n 
and n being a positive odd integer, to determine whether or not the generalized Ko- 
lakoski words are uniformly recurrent is a fascinating problem. In general, to ascertain 
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whether smooth infinite words over n-letter alphabet E n are uniformly recurrent also 
deserves further investigations. Similarly, letter frequency and permutation invariant 
property of smooth infinite words also merits further explorations. 

Moreover, by corollary 9, if n \ di for i = 1, 2, • • • , n, then the letter frequency of 
generalized Kolakoski words is 1/n for the alphabet S n with given cyclic order. 

In addition, let n = 2m, a, = n ■ qi + r for i = 1, 2, • ■ ■ , n and r — m. And if 
we could construct a primitive substitution a of constant length n over the alphabet 
{A 1 , A 2 , . . . , A n , B 1 , B 2 , • ■ ■ , B m }, then from [22] Proposition V.9. it follows that each 
letter in the fixed point u = lim^oo o~ l {Ai) of er, occurs in u with a positive frequency. 
Let Pi be the frequency of A± occurring in u for % = 1, 2, • • • , n and qi be the frequency 
of Bi occurring in u for i = 1, 2, • • • , m, then by Aj = c™ and Bj = c^^c^, we see 
that the letter C2j_i occurs exactly in A 2 i-\ and -Bj, and c 2 % occurs exactly in A 2 i and 
Bi. Thus the frequency of c 2 j_i occurring in K ClC2 ... Cn is equal to p2i-i + 1/2 • q iy and 
the frequency of c 2 i occurring in K ClC2 ... Cn is equal to p 2 i + 1/2 ■ q^ for z = 1, 2, • • • , m. 
Therefore we could arrive at the following attractive result: 

For the n-letter alphabet S n with the cyclic order, if r > 0, n = 2m and r = m, 
the letters occurred in the generalized Kolakoski words have the positive frequency. 

Thus the following open problem is very significative. 

Prove (or disprove) that for the generalized Kolakoski sequence K u over n- 
letter alphabet S n , there exists a primitive substitution a of length constant over the 
alphabet {Ai, A 2 , . . . , A n , B\, B 2 , • ■ ■ , B m } such that K u is a fixed point of a. 

References 

1. M. Baake, B. Sing, Kolakoski- (3,1) is a (deformed) model set, Canad. 
Math. Bull, 47 (2) (2004), 168-190. 

2. V. Berthe, S. Brlek, P. Choquette, Smooth words over arbitrary 
alphabets, Theoretical Computer Science, 341 (2005), 293-310. 

3. S. Brlek, S. Dulucq, A. Ladouceur, L. Vuillon, Combinatorial properties 
of smooth infinite words, Theoretical Computer Science, 352 (2006), 306-317. 

4. S. Brlek, D. Jamet, G. Paquin, Smooth words on 2-letter alphabets having 
same parity, Theoretical Computer Science, 393 (2008), 166-181. 

5. S. Brlek, A. Ladouceur, A note on differentiate Palindromes, Theoretical 
Computer Science, 302 (2003), 167-178. 



23 



6. S. Brlek, G. Melangon , G. Paquin, Properties of the extremal infinite 
smooth words, Discrete Mathematics and Theoretical Computer Science, 
DMTCS , 9 (2) (2007), 33-50. 

7. A. Carpi, Repetitions in the Kolakovski sequence, Bull, of the EATCS, 50 

(1993) , 194-196. 

8. A. Carpi, On repeated factors in C°° -words, Information Processing Letters, 
52 (6) (1994), 289-294. 

9. V. Chvatal, Notes On the Kolakoski sequence, DIMACS Tech. Rep., 93-84 

(1994) . 

10. F. M. Dekking, Regularity and irragularity of sequences generated by 
automata, Seminaire de Theorie des Nombres de Bordeaux, 1979-80, expose 
n° 9, 901-910. 

11. F. M. Dekking, On the structure of self generating sequences, Seminaire de 
Theorie des Nombres de Bordeaux, 1980-81, expose n° 31, 3101-3106. 

12. F. M. Dekking, What is the long range order in the kolakoski sequence? in 
R.V.Moody (ed.), The mathematics of Long- Range Aperiodic order, Kluwer 
Academic Publishers (1997) 115-125. 

13. Pytheas Fogg, Substitutions in Dynamics, Arithmetics and Combinatorics, 
Lecture Notes in Mathematics 1794, Springer, 2002. 

14. Y. B. Huang, About the number of C°° -words of form wxw, Theoretical 
Computer Science, 393 (2008), 280-286. 

15. Y. B. Huang, The complexity of C 1 ^ -words of the form wxw, Theoret. 
Comput. ScL, 410 (2009), 4892-4904. 

16. Y. B. Huang, W. D. Weakley, A note on the complexity of C°° -words, 
Theoret. Comput. Sci., 411 (2010), 3731-3735. 

17. M. S. Keane, Ergodic theory and subshifts of finite type, in: Ergodic Theory, 
Symbolic Dynamics and Hyperbolic Spaces, T. Bedford, M. Keane, C. Series 
(Eds.), Oxford University Press (Oxford 1991), 350-370. 



24 



18. C. Kimberling, Problem 6287, Amer. Math. Monthly, 86 (1979), 35-70. 

19. W. Kolakoski, Self-genetating runs, Problem 5304, Amer. Math. Monthly, 
72 (1965), 674. Solution: Amer. Math. Monthly 73 (1966), 681-682. 

20. A. Lepisto, Repetitions in the Kolakoski sequence, Development in Language 
Theory (1993), 130-143. 

21. G. Paun, How much Thue is Kolakovski? Bull, of the EATCS, 49 (1993), 
183-185. 

22. M. Queffelec, Substitution Dynamical Systems- Spectral Analysis, Lecture 
Notes in Mathematics 1294 (1987), Springer- Verlag, 94. 

23. R. Steacy, Structure in the Kolakoski sequence, Bull, of the EATCS, 59 
(1996), 173-182. 

24. B. Sing, Kolakoski sequences - an example of aperiodic order, J. Non-Cryst. 
Solids, 334-335 (2004), 100-104. 

25. B. Sing, Kolakoski- (2m, 2n) are limit-periodic model sets, J. Math. Phys. 44 
(2) (2003), 899-912. 

26. B. Steinsky, A Recursive Formula for the Kolakoski Sequence, Journal of 
Integer Sequences, 9 (2006), Article 06.3.7. 

27. W. D. Weakley, On the number of C°° -words of each length, Jour, of comb. 
Theory, Ser.A, 51 (1989), 55-62. 



25 



